Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 10866 |
| Missing cells | 13434 |
| Missing cells (%) | 5.9% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 1.7 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Text | 10 |
| DateTime | 1 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
id is highly overall correlated with release_year | High correlation |
popularity is highly overall correlated with budget and 4 other fields | High correlation |
budget is highly overall correlated with popularity and 4 other fields | High correlation |
revenue is highly overall correlated with popularity and 4 other fields | High correlation |
vote_count is highly overall correlated with popularity and 4 other fields | High correlation |
release_year is highly overall correlated with id | High correlation |
budget_adj is highly overall correlated with popularity and 4 other fields | High correlation |
revenue_adj is highly overall correlated with popularity and 4 other fields | High correlation |
homepage has 7930 (73.0%) missing values | Missing |
tagline has 2824 (26.0%) missing values | Missing |
keywords has 1493 (13.7%) missing values | Missing |
production_companies has 1030 (9.5%) missing values | Missing |
budget has 5696 (52.4%) zeros | Zeros |
revenue has 6016 (55.4%) zeros | Zeros |
budget_adj has 5696 (52.4%) zeros | Zeros |
revenue_adj has 6016 (55.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-10-11 23:31:49.173566 |
|---|---|
| Analysis finished | 2023-10-11 23:31:56.240453 |
| Duration | 7.07 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10865 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66064.177 |
| Minimum | 5 |
|---|---|
| Maximum | 417859 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 1221 |
| Q1 | 10596.25 |
| median | 20669 |
| Q3 | 75610 |
| 95-th percentile | 288556 |
| Maximum | 417859 |
| Range | 417854 |
| Interquartile range (IQR) | 65013.75 |
Descriptive statistics
| Standard deviation | 92130.137 |
|---|---|
| Coefficient of variation (CV) | 1.3945551 |
| Kurtosis | 1.781869 |
| Mean | 66064.177 |
| Median Absolute Deviation (MAD) | 15121.5 |
| Skewness | 1.7322939 |
| Sum | 7.1785335 × 108 |
| Variance | 8.4879621 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42194 | 2 | < 0.1% |
| 135397 | 1 | < 0.1% |
| 9534 | 1 | < 0.1% |
| 70476 | 1 | < 0.1% |
| 44345 | 1 | < 0.1% |
| 16358 | 1 | < 0.1% |
| 20304 | 1 | < 0.1% |
| 20544 | 1 | < 0.1% |
| 18442 | 1 | < 0.1% |
| 49870 | 1 | < 0.1% |
| Other values (10855) | 10855 |
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 6 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 16 | 1 | |
| 17 | 1 | |
| 18 | 1 | |
| 20 | 1 |
| Value | Count | Frequency (%) |
| 417859 | 1 | |
| 414419 | 1 | |
| 409696 | 1 | |
| 395883 | 1 | |
| 395560 | 1 | |
| 386501 | 1 | |
| 382517 | 1 | |
| 378373 | 1 | |
| 376823 | 1 | |
| 374430 | 1 |
imdb_id
Text
| Distinct | 10855 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 10 |
| Missing (%) | 0.1% |
| Memory size | 85.0 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 97704 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10854 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | tt0369610 |
|---|---|
| 2nd row | tt1392190 |
| 3rd row | tt2908446 |
| 4th row | tt2488496 |
| 5th row | tt2820852 |
| Value | Count | Frequency (%) |
| tt0411951 | 2 | < 0.1% |
| tt3659388 | 1 | < 0.1% |
| tt1798684 | 1 | < 0.1% |
| tt1964418 | 1 | < 0.1% |
| tt1951266 | 1 | < 0.1% |
| tt2908446 | 1 | < 0.1% |
| tt2488496 | 1 | < 0.1% |
| tt2820852 | 1 | < 0.1% |
| tt1663202 | 1 | < 0.1% |
| tt1340138 | 1 | < 0.1% |
| Other values (10845) | 10845 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 21712 | |
| 0 | 15108 | |
| 1 | 10229 | |
| 2 | 7559 | 7.7% |
| 3 | 6667 | 6.8% |
| 4 | 6546 | 6.7% |
| 8 | 6328 | 6.5% |
| 7 | 6087 | 6.2% |
| 9 | 6077 | 6.2% |
| 6 | 5822 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 75992 | |
| Lowercase Letter | 21712 | 22.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15108 | |
| 1 | 10229 | |
| 2 | 7559 | |
| 3 | 6667 | |
| 4 | 6546 | |
| 8 | 6328 | |
| 7 | 6087 | |
| 9 | 6077 | |
| 6 | 5822 | 7.7% |
| 5 | 5569 | 7.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 21712 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 75992 | |
| Latin | 21712 | 22.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15108 | |
| 1 | 10229 | |
| 2 | 7559 | |
| 3 | 6667 | |
| 4 | 6546 | |
| 8 | 6328 | |
| 7 | 6087 | |
| 9 | 6077 | |
| 6 | 5822 | 7.7% |
| 5 | 5569 | 7.3% |
Latin
| Value | Count | Frequency (%) |
| t | 21712 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97704 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 21712 | |
| 0 | 15108 | |
| 1 | 10229 | |
| 2 | 7559 | 7.7% |
| 3 | 6667 | 6.8% |
| 4 | 6546 | 6.7% |
| 8 | 6328 | 6.5% |
| 7 | 6087 | 6.2% |
| 9 | 6077 | 6.2% |
| 6 | 5822 | 6.0% |
popularity
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 10814 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.64644095 |
| Minimum | 6.5 × 10-5 |
|---|---|
| Maximum | 32.985763 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 6.5 × 10-5 |
|---|---|
| 5-th percentile | 0.06425225 |
| Q1 | 0.20758275 |
| median | 0.3838555 |
| Q3 | 0.713817 |
| 95-th percentile | 2.0466017 |
| Maximum | 32.985763 |
| Range | 32.985698 |
| Interquartile range (IQR) | 0.50623425 |
Descriptive statistics
| Standard deviation | 1.0001849 |
|---|---|
| Coefficient of variation (CV) | 1.5472178 |
| Kurtosis | 210.99813 |
| Mean | 0.64644095 |
| Median Absolute Deviation (MAD) | 0.2153785 |
| Skewness | 9.8763313 |
| Sum | 7024.2274 |
| Variance | 1.0003699 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.028143 | 2 | < 0.1% |
| 0.144297 | 2 | < 0.1% |
| 0.158021 | 2 | < 0.1% |
| 0.430191 | 2 | < 0.1% |
| 0.210808 | 2 | < 0.1% |
| 0.22758 | 2 | < 0.1% |
| 0.623706 | 2 | < 0.1% |
| 0.109305 | 2 | < 0.1% |
| 0.247926 | 2 | < 0.1% |
| 0.326556 | 2 | < 0.1% |
| Other values (10804) | 10846 |
| Value | Count | Frequency (%) |
| 6.5 × 10-5 | 1 | |
| 0.000188 | 1 | |
| 0.00062 | 1 | |
| 0.000973 | 1 | |
| 0.001115 | 1 | |
| 0.001117 | 1 | |
| 0.001315 | 1 | |
| 0.001317 | 1 | |
| 0.001349 | 1 | |
| 0.001372 | 1 |
| Value | Count | Frequency (%) |
| 32.985763 | 1 | |
| 28.419936 | 1 | |
| 24.949134 | 1 | |
| 14.311205 | 1 | |
| 13.112507 | 1 | |
| 12.971027 | 1 | |
| 12.037933 | 1 | |
| 11.422751 | 1 | |
| 11.173104 | 1 | |
| 10.739009 | 1 |
budget
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 557 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14625701 |
| Minimum | 0 |
|---|---|
| Maximum | 4.25 × 108 |
| Zeros | 5696 |
| Zeros (%) | 52.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 15000000 |
| 95-th percentile | 75000000 |
| Maximum | 4.25 × 108 |
| Range | 4.25 × 108 |
| Interquartile range (IQR) | 15000000 |
Descriptive statistics
| Standard deviation | 30913214 |
|---|---|
| Coefficient of variation (CV) | 2.1136227 |
| Kurtosis | 19.269436 |
| Mean | 14625701 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.7172371 |
| Sum | 1.5892287 × 1011 |
| Variance | 9.5562679 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5696 | |
| 20000000 | 190 | 1.7% |
| 15000000 | 183 | 1.7% |
| 25000000 | 178 | 1.6% |
| 10000000 | 176 | 1.6% |
| 30000000 | 165 | 1.5% |
| 5000000 | 141 | 1.3% |
| 40000000 | 134 | 1.2% |
| 35000000 | 128 | 1.2% |
| 12000000 | 120 | 1.1% |
| Other values (547) | 3755 |
| Value | Count | Frequency (%) |
| 0 | 5696 | |
| 1 | 4 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 3 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 8 | 3 | < 0.1% |
| 10 | 6 | 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 425000000 | 1 | < 0.1% |
| 380000000 | 1 | < 0.1% |
| 300000000 | 1 | < 0.1% |
| 280000000 | 1 | < 0.1% |
| 270000000 | 1 | < 0.1% |
| 260000000 | 2 | < 0.1% |
| 258000000 | 1 | < 0.1% |
| 255000000 | 1 | < 0.1% |
| 250000000 | 7 | |
| 245000000 | 1 | < 0.1% |
revenue
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 4702 |
|---|---|
| Distinct (%) | 43.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39823320 |
| Minimum | 0 |
|---|---|
| Maximum | 2.7815058 × 109 |
| Zeros | 6016 |
| Zeros (%) | 55.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 24000000 |
| 95-th percentile | 2.1367216 × 108 |
| Maximum | 2.7815058 × 109 |
| Range | 2.7815058 × 109 |
| Interquartile range (IQR) | 24000000 |
Descriptive statistics
| Standard deviation | 1.1700349 × 108 |
|---|---|
| Coefficient of variation (CV) | 2.9380646 |
| Kurtosis | 73.168489 |
| Mean | 39823320 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.6583972 |
| Sum | 4.3272019 × 1011 |
| Variance | 1.3689816 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6016 | |
| 12000000 | 10 | 0.1% |
| 10000000 | 8 | 0.1% |
| 11000000 | 7 | 0.1% |
| 6000000 | 6 | 0.1% |
| 2000000 | 6 | 0.1% |
| 5000000 | 6 | 0.1% |
| 30000000 | 5 | < 0.1% |
| 20000000 | 5 | < 0.1% |
| 14000000 | 5 | < 0.1% |
| Other values (4692) | 4792 |
| Value | Count | Frequency (%) |
| 0 | 6016 | |
| 2 | 2 | < 0.1% |
| 3 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 3 | < 0.1% |
| 12 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2781505847 | 1 | |
| 2068178225 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 | |
| 1506249360 | 1 | |
| 1405035767 | 1 | |
| 1327817822 | 1 | |
| 1274219009 | 1 | |
| 1215439994 | 1 |
original_title
Text
| Distinct | 10571 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
Length
| Max length | 104 |
|---|---|
| Median length | 70 |
| Mean length | 16.002209 |
| Min length | 1 |
Characters and Unicode
| Total characters | 173880 |
|---|---|
| Distinct characters | 164 |
| Distinct categories | 19 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 10294 ? |
|---|---|
| Unique (%) | 94.7% |
Sample
| 1st row | Jurassic World |
|---|---|
| 2nd row | Mad Max: Fury Road |
| 3rd row | Insurgent |
| 4th row | Star Wars: The Force Awakens |
| 5th row | Furious 7 |
| Value | Count | Frequency (%) |
| the | 3279 | 10.6% |
| of | 969 | 3.1% |
| a | 386 | 1.2% |
| in | 327 | 1.1% |
| and | 317 | 1.0% |
| to | 227 | 0.7% |
| 2 | 226 | 0.7% |
| 211 | 0.7% | |
| man | 148 | 0.5% |
| for | 113 | 0.4% |
| Other values (8859) | 24843 |
Most occurring characters
| Value | Count | Frequency (%) |
| 20178 | 11.6% | |
| e | 17617 | 10.1% |
| a | 10758 | 6.2% |
| o | 10239 | 5.9% |
| r | 9356 | 5.4% |
| n | 9343 | 5.4% |
| i | 9062 | 5.2% |
| t | 8414 | 4.8% |
| s | 6857 | 3.9% |
| h | 6463 | 3.7% |
| Other values (154) | 65593 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 122010 | |
| Uppercase Letter | 27184 | 15.6% |
| Space Separator | 20195 | 11.6% |
| Other Punctuation | 2612 | 1.5% |
| Decimal Number | 1140 | 0.7% |
| Dash Punctuation | 212 | 0.1% |
| Modifier Symbol | 114 | 0.1% |
| Other Symbol | 101 | 0.1% |
| Currency Symbol | 78 | < 0.1% |
| Other Number | 71 | < 0.1% |
| Other values (9) | 163 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17617 | |
| a | 10758 | 8.8% |
| o | 10239 | 8.4% |
| r | 9356 | 7.7% |
| n | 9343 | 7.7% |
| i | 9062 | 7.4% |
| t | 8414 | 6.9% |
| s | 6857 | 5.6% |
| h | 6463 | 5.3% |
| l | 5845 | 4.8% |
| Other values (35) | 28056 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3616 | 13.3% |
| S | 2228 | 8.2% |
| B | 1815 | 6.7% |
| M | 1783 | 6.6% |
| C | 1613 | 5.9% |
| D | 1579 | 5.8% |
| A | 1541 | 5.7% |
| L | 1281 | 4.7% |
| H | 1244 | 4.6% |
| P | 1197 | 4.4% |
| Other values (27) | 9287 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1097 | |
| ' | 560 | |
| . | 368 | 14.1% |
| , | 154 | 5.9% |
| & | 153 | 5.9% |
| ! | 123 | 4.7% |
| ? | 41 | 1.6% |
| / | 29 | 1.1% |
| ¡ | 14 | 0.5% |
| • | 12 | 0.5% |
| Other values (14) | 61 | 2.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 327 | |
| 3 | 172 | |
| 1 | 172 | |
| 0 | 167 | |
| 4 | 81 | 7.1% |
| 5 | 67 | 5.9% |
| 9 | 48 | 4.2% |
| 7 | 42 | 3.7% |
| 6 | 35 | 3.1% |
| 8 | 29 | 2.5% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 29 | |
| ¤ | 18 | |
| ¢ | 12 | |
| £ | 10 | 12.8% |
| ¥ | 5 | 6.4% |
| $ | 4 | 5.1% |
Other Number
| Value | Count | Frequency (%) |
| ¹ | 22 | |
| ³ | 14 | |
| ¼ | 14 | |
| ½ | 9 | |
| ² | 7 | 9.9% |
| ¾ | 5 | 7.0% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¸ | 63 | |
| ¨ | 24 | 21.1% |
| ´ | 14 | 12.3% |
| ˜ | 9 | 7.9% |
| ¯ | 4 | 3.5% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 60 | |
| ™ | 20 | 19.8% |
| ° | 11 | 10.9% |
| ¦ | 7 | 6.9% |
| ® | 3 | 3.0% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 9 | |
| ¬ | 8 | |
| × | 5 | |
| + | 3 | 12.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 8 | |
| ” | 7 | |
| ’ | 3 | 14.3% |
| › | 3 | 14.3% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 7 | |
| ‹ | 7 | |
| « | 5 | |
| “ | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 206 | |
| — | 5 | 2.4% |
| – | 1 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 | |
| ‚ | 14 | |
| „ | 7 |
Space Separator
| Value | Count | Frequency (%) |
| 20178 | ||
| 17 | 0.1% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 13 | |
| º | 8 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 11 |
Format
| Value | Count | Frequency (%) |
| | 6 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149210 | |
| Common | 24670 | 14.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17617 | 11.8% |
| a | 10758 | 7.2% |
| o | 10239 | 6.9% |
| r | 9356 | 6.3% |
| n | 9343 | 6.3% |
| i | 9062 | 6.1% |
| t | 8414 | 5.6% |
| s | 6857 | 4.6% |
| h | 6463 | 4.3% |
| l | 5845 | 3.9% |
| Other values (73) | 55256 |
Common
| Value | Count | Frequency (%) |
| 20178 | ||
| : | 1097 | 4.4% |
| ' | 560 | 2.3% |
| . | 368 | 1.5% |
| 2 | 327 | 1.3% |
| - | 206 | 0.8% |
| 3 | 172 | 0.7% |
| 1 | 172 | 0.7% |
| 0 | 167 | 0.7% |
| , | 154 | 0.6% |
| Other values (71) | 1269 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172797 | |
| None | 915 | 0.5% |
| Punctuation | 99 | 0.1% |
| Currency Symbols | 29 | < 0.1% |
| Letterlike Symbols | 20 | < 0.1% |
| Modifier Letters | 20 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 20178 | 11.7% | |
| e | 17617 | 10.2% |
| a | 10758 | 6.2% |
| o | 10239 | 5.9% |
| r | 9356 | 5.4% |
| n | 9343 | 5.4% |
| i | 9062 | 5.2% |
| t | 8414 | 4.9% |
| s | 6857 | 4.0% |
| h | 6463 | 3.7% |
| Other values (73) | 64510 |
None
| Value | Count | Frequency (%) |
| Ã | 162 | 17.7% |
| ¸ | 63 | 6.9% |
| © | 60 | 6.6% |
| à | 50 | 5.5% |
| ì | 31 | 3.4% |
| ¨ | 24 | 2.6% |
| ¹ | 22 | 2.4% |
| å | 21 | 2.3% |
| ¤ | 18 | 2.0% |
| ã | 17 | 1.9% |
| Other values (52) | 447 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 29 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 20 |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 14 | |
| • | 12 | |
| ‰ | 11 | |
| ‡ | 8 | |
| ‘ | 7 | |
| … | 7 | |
| ‹ | 7 | |
| „ | 7 | |
| ” | 7 | |
| “ | 5 | 5.1% |
| Other values (5) | 14 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 11 | |
| ˜ | 9 |
cast
Text
| Distinct | 10719 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 76 |
| Missing (%) | 0.7% |
| Memory size | 85.0 KiB |
Length
| Max length | 110 |
|---|---|
| Median length | 94 |
| Mean length | 67.872567 |
| Min length | 7 |
Characters and Unicode
| Total characters | 732345 |
|---|---|
| Distinct characters | 126 |
| Distinct categories | 16 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 10666 ? |
|---|---|
| Unique (%) | 98.9% |
Sample
| 1st row | Chris Pratt|Bryce Dallas Howard|Irrfan Khan|Vincent D'Onofrio|Nick Robinson |
|---|---|
| 2nd row | Tom Hardy|Charlize Theron|Hugh Keays-Byrne|Nicholas Hoult|Josh Helman |
| 3rd row | Shailene Woodley|Theo James|Kate Winslet|Ansel Elgort|Miles Teller |
| 4th row | Harrison Ford|Mark Hamill|Carrie Fisher|Adam Driver|Daisy Ridley |
| 5th row | Vin Diesel|Paul Walker|Jason Statham|Michelle Rodriguez|Dwayne Johnson |
| Value | Count | Frequency (%) |
| michael | 285 | 0.4% |
| john | 231 | 0.3% |
| de | 209 | 0.3% |
| james | 177 | 0.3% |
| robert | 158 | 0.2% |
| tom | 154 | 0.2% |
| lee | 139 | 0.2% |
| jason | 132 | 0.2% |
| van | 131 | 0.2% |
| david | 128 | 0.2% |
| Other values (46644) | 64967 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 65672 | 9.0% |
| a | 61498 | 8.4% |
| 55922 | 7.6% | |
| n | 50507 | 6.9% |
| r | 44415 | 6.1% |
| i | 42812 | 5.8% |
| | | 41783 | 5.7% |
| o | 37725 | 5.2% |
| l | 35122 | 4.8% |
| t | 24677 | 3.4% |
| Other values (116) | 272212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 517877 | |
| Uppercase Letter | 112598 | 15.4% |
| Space Separator | 55935 | 7.6% |
| Math Symbol | 41835 | 5.7% |
| Other Punctuation | 2123 | 0.3% |
| Dash Punctuation | 811 | 0.1% |
| Other Symbol | 591 | 0.1% |
| Format | 130 | < 0.1% |
| Modifier Symbol | 119 | < 0.1% |
| Other Number | 107 | < 0.1% |
| Other values (6) | 219 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 65672 | |
| a | 61498 | |
| n | 50507 | |
| r | 44415 | 8.6% |
| i | 42812 | 8.3% |
| o | 37725 | 7.3% |
| l | 35122 | 6.8% |
| t | 24677 | 4.8% |
| s | 24214 | 4.7% |
| h | 19623 | 3.8% |
| Other values (24) | 111612 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 9756 | 8.7% |
| J | 9025 | 8.0% |
| S | 8852 | 7.9% |
| C | 8638 | 7.7% |
| B | 8209 | 7.3% |
| D | 7012 | 6.2% |
| R | 6503 | 5.8% |
| A | 6314 | 5.6% |
| L | 5342 | 4.7% |
| H | 5002 | 4.4% |
| Other values (24) | 37945 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1300 | |
| ' | 481 | 22.7% |
| ¡ | 147 | 6.9% |
| , | 49 | 2.3% |
| ¶ | 43 | 2.0% |
| § | 36 | 1.7% |
| ‡ | 28 | 1.3% |
| … | 16 | 0.8% |
| ‰ | 10 | 0.5% |
| " | 6 | 0.3% |
| Other values (5) | 7 | 0.3% |
Other Number
| Value | Count | Frequency (%) |
| ³ | 50 | |
| ¼ | 49 | |
| ² | 4 | 3.7% |
| ¾ | 2 | 1.9% |
| ¹ | 1 | 0.9% |
| ½ | 1 | 0.9% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 567 | |
| ™ | 11 | 1.9% |
| ® | 9 | 1.5% |
| ° | 3 | 0.5% |
| ¦ | 1 | 0.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¨ | 53 | |
| ¯ | 28 | |
| ¸ | 24 | |
| ´ | 13 | 10.9% |
| ˜ | 1 | 0.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¥ | 39 | |
| ¤ | 20 | |
| £ | 6 | 8.3% |
| € | 4 | 5.6% |
| ¢ | 3 | 4.2% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 56 | |
| “ | 6 | 9.4% |
| ‹ | 1 | 1.6% |
| ‘ | 1 | 1.6% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 41783 | |
| ± | 50 | 0.1% |
| ¬ | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 804 | |
| – | 6 | 0.7% |
| — | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12 | |
| 5 | 12 | |
| 2 | 1 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 55922 | ||
| 13 | < 0.1% |
Other Letter
| Value | Count | Frequency (%) |
| º | 30 | |
| ª | 2 | 6.2% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 11 | |
| › | 4 | 26.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| „ | 7 | |
| ‚ | 4 |
Format
| Value | Count | Frequency (%) |
| | 130 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 630506 | |
| Common | 101839 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 65672 | 10.4% |
| a | 61498 | 9.8% |
| n | 50507 | 8.0% |
| r | 44415 | 7.0% |
| i | 42812 | 6.8% |
| o | 37725 | 6.0% |
| l | 35122 | 5.6% |
| t | 24677 | 3.9% |
| s | 24214 | 3.8% |
| h | 19623 | 3.1% |
| Other values (59) | 224241 |
Common
| Value | Count | Frequency (%) |
| 55922 | ||
| | | 41783 | |
| . | 1300 | 1.3% |
| - | 804 | 0.8% |
| © | 567 | 0.6% |
| ' | 481 | 0.5% |
| ¡ | 147 | 0.1% |
| | 130 | 0.1% |
| « | 56 | 0.1% |
| ¨ | 53 | 0.1% |
| Other values (47) | 596 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 729305 | |
| None | 2939 | 0.4% |
| Punctuation | 85 | < 0.1% |
| Letterlike Symbols | 11 | < 0.1% |
| Currency Symbols | 4 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 65672 | 9.0% |
| a | 61498 | 8.4% |
| 55922 | 7.7% | |
| n | 50507 | 6.9% |
| r | 44415 | 6.1% |
| i | 42812 | 5.9% |
| | | 41783 | 5.7% |
| o | 37725 | 5.2% |
| l | 35122 | 4.8% |
| t | 24677 | 3.4% |
| Other values (54) | 269172 |
None
| Value | Count | Frequency (%) |
| Ã | 1409 | |
| © | 567 | |
| ¡ | 147 | 5.0% |
| | 130 | 4.4% |
| « | 56 | 1.9% |
| ¨ | 53 | 1.8% |
| ± | 50 | 1.7% |
| ³ | 50 | 1.7% |
| ¼ | 49 | 1.7% |
| Ä | 48 | 1.6% |
| Other values (37) | 380 | 12.9% |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 28 | |
| … | 16 | |
| ‰ | 10 | 11.8% |
| „ | 7 | 8.2% |
| “ | 6 | 7.1% |
| – | 6 | 7.1% |
| ‚ | 4 | 4.7% |
| › | 4 | 4.7% |
| ‹ | 1 | 1.2% |
| — | 1 | 1.2% |
| Other values (2) | 2 | 2.4% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 11 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 4 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˜ | 1 |
homepage
Text
MISSING 
| Distinct | 2896 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 7930 |
| Missing (%) | 73.0% |
| Memory size | 85.0 KiB |
Length
| Max length | 242 |
|---|---|
| Median length | 89 |
| Mean length | 37.146117 |
| Min length | 13 |
Characters and Unicode
| Total characters | 109061 |
|---|---|
| Distinct characters | 83 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 2868 ? |
|---|---|
| Unique (%) | 97.7% |
Sample
| 1st row | http://www.jurassicworld.com/ |
|---|---|
| 2nd row | http://www.madmaxmovie.com/ |
| 3rd row | http://www.thedivergentseries.movie/#insurgent |
| 4th row | http://www.starwars.com/films/star-wars-episode-vii |
| 5th row | http://www.furious7.com/ |
| Value | Count | Frequency (%) |
| http://www.missionimpossible.com | 5 | 0.2% |
| http://www.thehungergames.movie | 4 | 0.1% |
| http://www.transformersmovie.com | 4 | 0.1% |
| http://phantasm.com | 4 | 0.1% |
| http://www.kungfupanda.com | 4 | 0.1% |
| http://www.georgecarlin.com | 3 | 0.1% |
| http://www.lordoftherings.net | 3 | 0.1% |
| http://www.americanreunionmovie.com | 3 | 0.1% |
| http://www.thehobbit.com | 3 | 0.1% |
| http://www.jeffdunham.com | 3 | 0.1% |
| Other values (2878) | 2900 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 9865 | 9.0% |
| / | 9843 | 9.0% |
| e | 7545 | 6.9% |
| w | 7528 | 6.9% |
| o | 7522 | 6.9% |
| m | 6035 | 5.5% |
| . | 5844 | 5.4% |
| h | 5380 | 4.9% |
| i | 5336 | 4.9% |
| c | 4475 | 4.1% |
| Other values (73) | 39688 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 86844 | |
| Other Punctuation | 18787 | 17.2% |
| Dash Punctuation | 1265 | 1.2% |
| Decimal Number | 1247 | 1.1% |
| Uppercase Letter | 648 | 0.6% |
| Connector Punctuation | 176 | 0.2% |
| Math Symbol | 81 | 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
| Currency Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 9865 | 11.4% |
| e | 7545 | 8.7% |
| w | 7528 | 8.7% |
| o | 7522 | 8.7% |
| m | 6035 | 6.9% |
| h | 5380 | 6.2% |
| i | 5336 | 6.1% |
| c | 4475 | 5.2% |
| p | 4174 | 4.8% |
| a | 3960 | 4.6% |
| Other values (17) | 25024 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 61 | 9.4% |
| S | 59 | 9.1% |
| M | 54 | 8.3% |
| A | 46 | 7.1% |
| E | 40 | 6.2% |
| F | 36 | 5.6% |
| B | 35 | 5.4% |
| D | 34 | 5.2% |
| H | 28 | 4.3% |
| C | 27 | 4.2% |
| Other values (17) | 228 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 9843 | |
| . | 5844 | |
| : | 2941 | 15.7% |
| ? | 59 | 0.3% |
| # | 40 | 0.2% |
| % | 26 | 0.1% |
| & | 18 | 0.1% |
| ! | 8 | < 0.1% |
| , | 6 | < 0.1% |
| ' | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 240 | |
| 0 | 199 | |
| 1 | 196 | |
| 3 | 141 | |
| 4 | 91 | 7.3% |
| 8 | 82 | 6.6% |
| 9 | 78 | 6.3% |
| 7 | 75 | 6.0% |
| 5 | 74 | 5.9% |
| 6 | 71 | 5.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 76 | |
| + | 5 | 6.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 | |
| { | 1 | 16.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 | |
| } | 1 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1265 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 176 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 87492 | |
| Common | 21569 | 19.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 9865 | 11.3% |
| e | 7545 | 8.6% |
| w | 7528 | 8.6% |
| o | 7522 | 8.6% |
| m | 6035 | 6.9% |
| h | 5380 | 6.1% |
| i | 5336 | 6.1% |
| c | 4475 | 5.1% |
| p | 4174 | 4.8% |
| a | 3960 | 4.5% |
| Other values (44) | 25672 |
Common
| Value | Count | Frequency (%) |
| / | 9843 | |
| . | 5844 | |
| : | 2941 | 13.6% |
| - | 1265 | 5.9% |
| 2 | 240 | 1.1% |
| 0 | 199 | 0.9% |
| 1 | 196 | 0.9% |
| _ | 176 | 0.8% |
| 3 | 141 | 0.7% |
| 4 | 91 | 0.4% |
| Other values (19) | 633 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 109058 | |
| None | 2 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 9865 | 9.0% |
| / | 9843 | 9.0% |
| e | 7545 | 6.9% |
| w | 7528 | 6.9% |
| o | 7522 | 6.9% |
| m | 6035 | 5.5% |
| . | 5844 | 5.4% |
| h | 5380 | 4.9% |
| i | 5336 | 4.9% |
| c | 4475 | 4.1% |
| Other values (70) | 39685 |
None
| Value | Count | Frequency (%) |
| â | 1 | |
| Ž | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
director
Text
| Distinct | 5067 |
|---|---|
| Distinct (%) | 46.8% |
| Missing | 44 |
| Missing (%) | 0.4% |
| Memory size | 85.0 KiB |
Length
| Max length | 533 |
|---|---|
| Median length | 169 |
| Mean length | 14.558122 |
| Min length | 2 |
Characters and Unicode
| Total characters | 157548 |
|---|---|
| Distinct characters | 96 |
| Distinct categories | 18 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 3217 ? |
|---|---|
| Unique (%) | 29.7% |
Sample
| 1st row | Colin Trevorrow |
|---|---|
| 2nd row | George Miller |
| 3rd row | Robert Schwentke |
| 4th row | J.J. Abrams |
| 5th row | James Wan |
| Value | Count | Frequency (%) |
| john | 436 | 1.8% |
| michael | 308 | 1.3% |
| david | 301 | 1.3% |
| robert | 212 | 0.9% |
| peter | 201 | 0.8% |
| james | 162 | 0.7% |
| richard | 159 | 0.7% |
| paul | 144 | 0.6% |
| mark | 110 | 0.5% |
| lee | 107 | 0.5% |
| Other values (6202) | 21600 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14631 | 9.3% |
| 12934 | 8.2% | |
| a | 12669 | 8.0% |
| n | 11046 | 7.0% |
| r | 10688 | 6.8% |
| o | 9160 | 5.8% |
| i | 9108 | 5.8% |
| l | 7511 | 4.8% |
| t | 5475 | 3.5% |
| s | 5207 | 3.3% |
| Other values (86) | 59119 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 116535 | |
| Uppercase Letter | 25718 | 16.3% |
| Space Separator | 12936 | 8.2% |
| Math Symbol | 1088 | 0.7% |
| Other Punctuation | 839 | 0.5% |
| Dash Punctuation | 178 | 0.1% |
| Other Symbol | 118 | 0.1% |
| Format | 35 | < 0.1% |
| Other Number | 32 | < 0.1% |
| Modifier Symbol | 28 | < 0.1% |
| Other values (8) | 41 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2357 | 9.2% |
| J | 2208 | 8.6% |
| M | 2207 | 8.6% |
| R | 1800 | 7.0% |
| B | 1686 | 6.6% |
| C | 1612 | 6.3% |
| D | 1484 | 5.8% |
| A | 1433 | 5.6% |
| G | 1264 | 4.9% |
| L | 1223 | 4.8% |
| Other values (20) | 8444 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14631 | |
| a | 12669 | |
| n | 11046 | |
| r | 10688 | |
| o | 9160 | 7.9% |
| i | 9108 | 7.8% |
| l | 7511 | 6.4% |
| t | 5475 | 4.7% |
| s | 5207 | 4.5% |
| h | 4545 | 3.9% |
| Other values (16) | 26495 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 645 | |
| ¡ | 65 | 7.7% |
| ' | 52 | 6.2% |
| ¶ | 30 | 3.6% |
| , | 18 | 2.1% |
| § | 15 | 1.8% |
| ‰ | 5 | 0.6% |
| ‡ | 5 | 0.6% |
| … | 4 | 0.5% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¨ | 12 | |
| ´ | 9 | |
| ¸ | 5 | |
| ¯ | 2 | 7.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1070 | |
| ± | 17 | 1.6% |
| ¬ | 1 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 112 | |
| ¦ | 5 | 4.2% |
| ™ | 1 | 0.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¥ | 11 | |
| ¤ | 7 | |
| € | 1 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 12934 | ||
| 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 177 | |
| – | 1 | 0.6% |
Other Number
| Value | Count | Frequency (%) |
| ³ | 27 | |
| ¼ | 5 | 15.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ‚ | 4 | |
| ( | 1 | 20.0% |
Other Letter
| Value | Count | Frequency (%) |
| º | 3 | |
| ª | 1 | 25.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 3 | |
| ‘ | 1 | 25.0% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 3 | |
| › | 1 | 25.0% |
Format
| Value | Count | Frequency (%) |
| | 35 |
Control
| Value | Count | Frequency (%) |
| 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 142257 | |
| Common | 15291 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14631 | 10.3% |
| a | 12669 | 8.9% |
| n | 11046 | 7.8% |
| r | 10688 | 7.5% |
| o | 9160 | 6.4% |
| i | 9108 | 6.4% |
| l | 7511 | 5.3% |
| t | 5475 | 3.8% |
| s | 5207 | 3.7% |
| h | 4545 | 3.2% |
| Other values (48) | 52217 |
Common
| Value | Count | Frequency (%) |
| 12934 | ||
| | | 1070 | 7.0% |
| . | 645 | 4.2% |
| - | 177 | 1.2% |
| © | 112 | 0.7% |
| ¡ | 65 | 0.4% |
| ' | 52 | 0.3% |
| | 35 | 0.2% |
| ¶ | 30 | 0.2% |
| ³ | 27 | 0.2% |
| Other values (28) | 144 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 156746 | |
| None | 779 | 0.5% |
| Punctuation | 21 | < 0.1% |
| Letterlike Symbols | 1 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 14631 | 9.3% |
| 12934 | 8.3% | |
| a | 12669 | 8.1% |
| n | 11046 | 7.0% |
| r | 10688 | 6.8% |
| o | 9160 | 5.8% |
| i | 9108 | 5.8% |
| l | 7511 | 4.8% |
| t | 5475 | 3.5% |
| s | 5207 | 3.3% |
| Other values (52) | 58317 |
None
| Value | Count | Frequency (%) |
| Ã | 377 | |
| © | 112 | 14.4% |
| ¡ | 65 | 8.3% |
| | 35 | 4.5% |
| ¶ | 30 | 3.9% |
| ³ | 27 | 3.5% |
| ± | 17 | 2.2% |
| Å | 15 | 1.9% |
| § | 15 | 1.9% |
| Ä | 14 | 1.8% |
| Other values (15) | 72 | 9.2% |
Punctuation
| Value | Count | Frequency (%) |
| ‰ | 5 | |
| ‡ | 5 | |
| … | 4 | |
| ‚ | 4 | |
| – | 1 | 4.8% |
| › | 1 | 4.8% |
| ‘ | 1 | 4.8% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
tagline
Text
MISSING 
| Distinct | 7997 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 2824 |
| Missing (%) | 26.0% |
| Memory size | 85.0 KiB |
Length
| Max length | 286 |
|---|---|
| Median length | 166 |
| Mean length | 44.174459 |
| Min length | 1 |
Characters and Unicode
| Total characters | 355251 |
|---|---|
| Distinct characters | 126 |
| Distinct categories | 16 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 7957 ? |
|---|---|
| Unique (%) | 98.9% |
Sample
| 1st row | The park is open. |
|---|---|
| 2nd row | What a Lovely Day. |
| 3rd row | One Choice Can Destroy You |
| 4th row | Every generation has a story. |
| 5th row | Vengeance Hits Home |
| Value | Count | Frequency (%) |
| the | 3989 | 6.1% |
| a | 2410 | 3.7% |
| to | 1460 | 2.2% |
| of | 1335 | 2.0% |
| is | 1247 | 1.9% |
| you | 1095 | 1.7% |
| in | 951 | 1.5% |
| and | 804 | 1.2% |
| one | 619 | 0.9% |
| it | 611 | 0.9% |
| Other values (7019) | 50857 |
Most occurring characters
| Value | Count | Frequency (%) |
| 57392 | ||
| e | 36795 | 10.4% |
| t | 22060 | 6.2% |
| o | 21674 | 6.1% |
| a | 18810 | 5.3% |
| n | 17991 | 5.1% |
| i | 17300 | 4.9% |
| r | 16801 | 4.7% |
| s | 16059 | 4.5% |
| h | 14126 | 4.0% |
| Other values (116) | 116243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 258603 | |
| Space Separator | 57393 | 16.2% |
| Uppercase Letter | 21147 | 6.0% |
| Other Punctuation | 16563 | 4.7% |
| Decimal Number | 950 | 0.3% |
| Dash Punctuation | 364 | 0.1% |
| Currency Symbol | 96 | < 0.1% |
| Other Symbol | 83 | < 0.1% |
| Open Punctuation | 17 | < 0.1% |
| Close Punctuation | 15 | < 0.1% |
| Other values (6) | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 36795 | |
| t | 22060 | 8.5% |
| o | 21674 | 8.4% |
| a | 18810 | 7.3% |
| n | 17991 | 7.0% |
| i | 17300 | 6.7% |
| r | 16801 | 6.5% |
| s | 16059 | 6.2% |
| h | 14126 | 5.5% |
| l | 11024 | 4.3% |
| Other values (26) | 65963 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3179 | |
| A | 1962 | 9.3% |
| S | 1542 | 7.3% |
| H | 1327 | 6.3% |
| I | 1262 | 6.0% |
| W | 1211 | 5.7% |
| B | 1038 | 4.9% |
| F | 899 | 4.3% |
| N | 896 | 4.2% |
| E | 889 | 4.2% |
| Other values (25) | 6942 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10962 | |
| ' | 2278 | 13.8% |
| , | 1587 | 9.6% |
| ! | 1041 | 6.3% |
| ? | 481 | 2.9% |
| " | 78 | 0.5% |
| : | 40 | 0.2% |
| * | 25 | 0.2% |
| & | 22 | 0.1% |
| # | 16 | 0.1% |
| Other values (8) | 33 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 266 | |
| 1 | 183 | |
| 2 | 109 | |
| 3 | 80 | 8.4% |
| 9 | 76 | 8.0% |
| 4 | 54 | 5.7% |
| 5 | 54 | 5.7% |
| 6 | 49 | 5.2% |
| 7 | 45 | 4.7% |
| 8 | 34 | 3.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 | |
| [ | 2 | 11.8% |
| „ | 1 | 5.9% |
| ‚ | 1 | 5.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˜ | 1 | |
| ¯ | 1 | |
| ´ | 1 | |
| ` | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 88 | |
| $ | 7 | 7.3% |
| ¤ | 1 | 1.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 48 | |
| ¦ | 34 | |
| © | 1 | 1.2% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3 | |
| + | 2 | |
| | | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 57392 | ||
| 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 | |
| ] | 2 | 13.3% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 | |
| ² | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 364 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 279750 | |
| Common | 75501 | 21.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 36795 | |
| t | 22060 | 7.9% |
| o | 21674 | 7.7% |
| a | 18810 | 6.7% |
| n | 17991 | 6.4% |
| i | 17300 | 6.2% |
| r | 16801 | 6.0% |
| s | 16059 | 5.7% |
| h | 14126 | 5.0% |
| l | 11024 | 3.9% |
| Other values (61) | 87110 |
Common
| Value | Count | Frequency (%) |
| 57392 | ||
| . | 10962 | 14.5% |
| ' | 2278 | 3.0% |
| , | 1587 | 2.1% |
| ! | 1041 | 1.4% |
| ? | 481 | 0.6% |
| - | 364 | 0.5% |
| 0 | 266 | 0.4% |
| 1 | 183 | 0.2% |
| 2 | 109 | 0.1% |
| Other values (45) | 838 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 354937 | |
| None | 166 | < 0.1% |
| Currency Symbols | 88 | < 0.1% |
| Letterlike Symbols | 48 | < 0.1% |
| Punctuation | 8 | < 0.1% |
| Modifier Letters | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 57392 | ||
| e | 36795 | 10.4% |
| t | 22060 | 6.2% |
| o | 21674 | 6.1% |
| a | 18810 | 5.3% |
| n | 17991 | 5.1% |
| i | 17300 | 4.9% |
| r | 16801 | 4.7% |
| s | 16059 | 4.5% |
| h | 14126 | 4.0% |
| Other values (78) | 115929 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 88 |
None
| Value | Count | Frequency (%) |
| â | 86 | |
| ¦ | 34 | 20.5% |
| Â | 5 | 3.0% |
| ã | 4 | 2.4% |
| œ | 3 | 1.8% |
| å | 3 | 1.8% |
| ƒ | 3 | 1.8% |
| Š | 2 | 1.2% |
| ½ | 2 | 1.2% |
| É | 2 | 1.2% |
| Other values (18) | 22 | 13.3% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 48 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 3 | |
| ˜ | 1 | 25.0% |
Punctuation
| Value | Count | Frequency (%) |
| ‰ | 2 | |
| “ | 2 | |
| ‡ | 1 | |
| „ | 1 | |
| … | 1 | |
| ‚ | 1 |
keywords
Text
MISSING 
| Distinct | 8804 |
|---|---|
| Distinct (%) | 93.9% |
| Missing | 1493 |
| Missing (%) | 13.7% |
| Memory size | 85.0 KiB |
Length
| Max length | 131 |
|---|---|
| Median length | 88 |
| Mean length | 41.972474 |
| Min length | 2 |
Characters and Unicode
| Total characters | 393408 |
|---|---|
| Distinct characters | 87 |
| Distinct categories | 17 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 8673 ? |
|---|---|
| Unique (%) | 92.5% |
Sample
| 1st row | monster|dna|tyrannosaurus rex|velociraptor|island |
|---|---|
| 2nd row | future|chase|post-apocalyptic|dystopia|australia |
| 3rd row | based on novel|revolution|dystopia|sequel|dystopic future |
| 4th row | android|spaceship|jedi|space opera|3d |
| 5th row | car race|speed|revenge|suspense|car |
| Value | Count | Frequency (%) |
| of | 599 | 2.2% |
| on | 546 | 2.0% |
| director | 357 | 1.3% |
| film | 231 | 0.9% |
| and | 231 | 0.9% |
| new | 203 | 0.8% |
| in | 190 | 0.7% |
| the | 177 | 0.7% |
| brother | 164 | 0.6% |
| based | 163 | 0.6% |
| Other values (16457) | 24006 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 37321 | 9.5% |
| i | 29650 | 7.5% |
| a | 29180 | 7.4% |
| | | 28077 | 7.1% |
| r | 27999 | 7.1% |
| o | 25158 | 6.4% |
| n | 24736 | 6.3% |
| t | 23340 | 5.9% |
| s | 22514 | 5.7% |
| 17513 | 4.5% | |
| Other values (77) | 127920 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 346243 | |
| Math Symbol | 28078 | 7.1% |
| Space Separator | 17536 | 4.5% |
| Dash Punctuation | 570 | 0.1% |
| Other Punctuation | 423 | 0.1% |
| Decimal Number | 366 | 0.1% |
| Uppercase Letter | 73 | < 0.1% |
| Open Punctuation | 30 | < 0.1% |
| Close Punctuation | 28 | < 0.1% |
| Other Symbol | 21 | < 0.1% |
| Other values (7) | 40 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 37321 | 10.8% |
| i | 29650 | 8.6% |
| a | 29180 | 8.4% |
| r | 27999 | 8.1% |
| o | 25158 | 7.3% |
| n | 24736 | 7.1% |
| t | 23340 | 6.7% |
| s | 22514 | 6.5% |
| l | 17116 | 4.9% |
| c | 13962 | 4.0% |
| Other values (27) | 95267 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 246 | |
| ' | 156 | |
| ¶ | 7 | 1.7% |
| § | 3 | 0.7% |
| • | 3 | 0.7% |
| & | 2 | 0.5% |
| ¡ | 2 | 0.5% |
| , | 1 | 0.2% |
| … | 1 | 0.2% |
| † | 1 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 82 | |
| 0 | 70 | |
| 9 | 68 | |
| 7 | 59 | |
| 3 | 41 | |
| 2 | 14 | 3.8% |
| 5 | 12 | 3.3% |
| 6 | 12 | 3.3% |
| 8 | 4 | 1.1% |
| 4 | 4 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Ã | 42 | |
| Â | 23 | |
| Ÿ | 5 | 6.8% |
| Î | 2 | 2.7% |
| Š | 1 | 1.4% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 17 | |
| ¦ | 3 | 14.3% |
| ° | 1 | 4.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 6 | |
| € | 5 | |
| ¥ | 2 | 15.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 | |
| ˜ | 2 | |
| ¸ | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 28077 | |
| ¬ | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 17513 | ||
| 23 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 28 | |
| ‚ | 2 | 6.7% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 6 | |
| « | 1 | 14.3% |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 3 | |
| ³ | 1 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 570 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 28 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 4 |
Other Letter
| Value | Count | Frequency (%) |
| º | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 346319 | |
| Common | 47089 | 12.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| | | 28077 | |
| 17513 | ||
| - | 570 | 1.2% |
| . | 246 | 0.5% |
| ' | 156 | 0.3% |
| 1 | 82 | 0.2% |
| 0 | 70 | 0.1% |
| 9 | 68 | 0.1% |
| 7 | 59 | 0.1% |
| 3 | 41 | 0.1% |
| Other values (34) | 207 | 0.4% |
Latin
| Value | Count | Frequency (%) |
| e | 37321 | 10.8% |
| i | 29650 | 8.6% |
| a | 29180 | 8.4% |
| r | 27999 | 8.1% |
| o | 25158 | 7.3% |
| n | 24736 | 7.1% |
| t | 23340 | 6.7% |
| s | 22514 | 6.5% |
| l | 17116 | 4.9% |
| c | 13962 | 4.0% |
| Other values (33) | 95343 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 393202 | |
| None | 182 | < 0.1% |
| Punctuation | 13 | < 0.1% |
| Modifier Letters | 6 | < 0.1% |
| Currency Symbols | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 37321 | 9.5% |
| i | 29650 | 7.5% |
| a | 29180 | 7.4% |
| | | 28077 | 7.1% |
| r | 27999 | 7.1% |
| o | 25158 | 6.4% |
| n | 24736 | 6.3% |
| t | 23340 | 5.9% |
| s | 22514 | 5.7% |
| 17513 | 4.5% | |
| Other values (35) | 127714 |
None
| Value | Count | Frequency (%) |
| Ã | 42 | |
| Â | 23 | |
| 23 | ||
| © | 17 | 9.3% |
| å | 8 | 4.4% |
| ¶ | 7 | 3.8% |
| ¤ | 6 | 3.3% |
| Ÿ | 5 | 2.7% |
| â | 5 | 2.7% |
| ¦ | 3 | 1.6% |
| Other values (24) | 43 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 6 | |
| • | 3 | |
| ‚ | 2 | 15.4% |
| … | 1 | 7.7% |
| † | 1 | 7.7% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 5 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 4 | |
| ˜ | 2 |
overview
Text
| Distinct | 10847 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 85.0 KiB |
Length
| Max length | 1000 |
|---|---|
| Median length | 738 |
| Mean length | 307.03756 |
| Min length | 13 |
Characters and Unicode
| Total characters | 3335042 |
|---|---|
| Distinct characters | 144 |
| Distinct categories | 20 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 10843 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | Twenty-two years after the events of Jurassic Park, Isla Nublar now features a fully functioning dinosaur theme park, Jurassic World, as originally envisioned by John Hammond. |
|---|---|
| 2nd row | An apocalyptic story set in the furthest reaches of our planet, in a stark desert landscape where humanity is broken, and most everyone is crazed fighting for the necessities of life. Within this world exist two rebels on the run who just might be able to restore order. There's Max, a man of action and a man of few words, who seeks peace of mind following the loss of his wife and child in the aftermath of the chaos. And Furiosa, a woman of action and a woman who believes her path to survival may be achieved if she can make it across the desert back to her childhood homeland. |
| 3rd row | Beatrice Prior must confront her inner demons and continue her fight against a powerful alliance which threatens to tear her society apart. |
| 4th row | Thirty years after defeating the Galactic Empire, Han Solo and his allies face a new threat from the evil Kylo Ren and his army of Stormtroopers. |
| 5th row | Deckard Shaw seeks revenge against Dominic Toretto and his family for his comatose brother. |
| Value | Count | Frequency (%) |
| the | 31655 | 5.6% |
| a | 23240 | 4.1% |
| to | 17549 | 3.1% |
| and | 16956 | 3.0% |
| of | 15597 | 2.7% |
| in | 10160 | 1.8% |
| his | 8607 | 1.5% |
| is | 7865 | 1.4% |
| with | 5514 | 1.0% |
| her | 4793 | 0.8% |
| Other values (38551) | 425509 |
Most occurring characters
| Value | Count | Frequency (%) |
| 556977 | ||
| e | 320395 | 9.6% |
| t | 219691 | 6.6% |
| a | 215612 | 6.5% |
| i | 194998 | 5.8% |
| o | 191213 | 5.7% |
| n | 191104 | 5.7% |
| s | 177760 | 5.3% |
| r | 174774 | 5.2% |
| h | 139782 | 4.2% |
| Other values (134) | 952736 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2591110 | |
| Space Separator | 557010 | 16.7% |
| Uppercase Letter | 90606 | 2.7% |
| Other Punctuation | 70522 | 2.1% |
| Dash Punctuation | 9126 | 0.3% |
| Decimal Number | 8887 | 0.3% |
| Open Punctuation | 1981 | 0.1% |
| Close Punctuation | 1977 | 0.1% |
| Currency Symbol | 1893 | 0.1% |
| Other Symbol | 1256 | < 0.1% |
| Other values (10) | 674 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10241 | 11.3% |
| T | 7658 | 8.5% |
| S | 6972 | 7.7% |
| B | 6009 | 6.6% |
| C | 5663 | 6.3% |
| M | 5455 | 6.0% |
| W | 4604 | 5.1% |
| H | 4191 | 4.6% |
| D | 4027 | 4.4% |
| J | 3560 | 3.9% |
| Other values (23) | 32226 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 320395 | |
| t | 219691 | 8.5% |
| a | 215612 | 8.3% |
| i | 194998 | 7.5% |
| o | 191213 | 7.4% |
| n | 191104 | 7.4% |
| s | 177760 | 6.9% |
| r | 174774 | 6.7% |
| h | 139782 | 5.4% |
| l | 111503 | 4.3% |
| Other values (21) | 654278 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 30277 | |
| . | 27504 | |
| ' | 7736 | 11.0% |
| " | 2437 | 3.5% |
| : | 793 | 1.1% |
| ? | 575 | 0.8% |
| ; | 471 | 0.7% |
| ! | 416 | 0.6% |
| / | 151 | 0.2% |
| & | 89 | 0.1% |
| Other values (11) | 73 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2014 | |
| 0 | 1884 | |
| 9 | 1229 | |
| 2 | 987 | |
| 5 | 508 | 5.7% |
| 3 | 489 | 5.5% |
| 8 | 477 | 5.4% |
| 7 | 463 | 5.2% |
| 4 | 430 | 4.8% |
| 6 | 406 | 4.6% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 1756 | |
| $ | 99 | 5.2% |
| ¢ | 15 | 0.8% |
| £ | 12 | 0.6% |
| ¤ | 7 | 0.4% |
| ¥ | 4 | 0.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˜ | 48 | |
| ¨ | 30 | |
| ¯ | 12 | 11.8% |
| ´ | 8 | 7.8% |
| ` | 3 | 2.9% |
| ¸ | 1 | 1.0% |
Other Number
| Value | Count | Frequency (%) |
| ³ | 11 | |
| ¼ | 10 | |
| ¹ | 4 | 12.5% |
| ² | 3 | 9.4% |
| ¾ | 2 | 6.2% |
| ½ | 2 | 6.2% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 883 | |
| © | 283 | 22.5% |
| ¦ | 71 | 5.7% |
| ® | 18 | 1.4% |
| ° | 1 | 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 174 | |
| » | 5 | 2.7% |
| ’ | 3 | 1.6% |
| › | 1 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 4 | |
| ¬ | 3 | |
| = | 2 | |
| | | 1 | 10.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1967 | |
| [ | 10 | 0.5% |
| „ | 4 | 0.2% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 295 | |
| « | 7 | 2.3% |
| ‹ | 3 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 556977 | ||
| 33 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9124 | |
| — | 2 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1967 | |
| ] | 10 | 0.5% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 10 | |
| º | 3 | 23.1% |
Control
| Value | Count | Frequency (%) |
| 13 |
Format
| Value | Count | Frequency (%) |
| | 13 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2681728 | |
| Common | 653314 | 19.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 556977 | ||
| , | 30277 | 4.6% |
| . | 27504 | 4.2% |
| - | 9124 | 1.4% |
| ' | 7736 | 1.2% |
| " | 2437 | 0.4% |
| 1 | 2014 | 0.3% |
| ( | 1967 | 0.3% |
| ) | 1967 | 0.3% |
| 0 | 1884 | 0.3% |
| Other values (69) | 11427 | 1.7% |
Latin
| Value | Count | Frequency (%) |
| e | 320395 | |
| t | 219691 | 8.2% |
| a | 215612 | 8.0% |
| i | 194998 | 7.3% |
| o | 191213 | 7.1% |
| n | 191104 | 7.1% |
| s | 177760 | 6.6% |
| r | 174774 | 6.5% |
| h | 139782 | 5.2% |
| l | 111503 | 4.2% |
| Other values (55) | 744896 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3328794 | |
| None | 3070 | 0.1% |
| Currency Symbols | 1756 | 0.1% |
| Letterlike Symbols | 883 | < 0.1% |
| Punctuation | 490 | < 0.1% |
| Modifier Letters | 49 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 556977 | ||
| e | 320395 | 9.6% |
| t | 219691 | 6.6% |
| a | 215612 | 6.5% |
| i | 194998 | 5.9% |
| o | 191213 | 5.7% |
| n | 191104 | 5.7% |
| s | 177760 | 5.3% |
| r | 174774 | 5.3% |
| h | 139782 | 4.2% |
| Other values (77) | 946488 |
None
| Value | Count | Frequency (%) |
| â | 1762 | |
| Ã | 489 | 15.9% |
| © | 283 | 9.2% |
| œ | 137 | 4.5% |
| ¦ | 71 | 2.3% |
| Â | 49 | 1.6% |
| 33 | 1.1% | |
| ¨ | 30 | 1.0% |
| ¡ | 23 | 0.7% |
| ® | 18 | 0.6% |
| Other values (32) | 175 | 5.7% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1756 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 883 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 295 | |
| ” | 174 | |
| „ | 4 | 0.8% |
| ‹ | 3 | 0.6% |
| • | 3 | 0.6% |
| ’ | 3 | 0.6% |
| … | 2 | 0.4% |
| — | 2 | 0.4% |
| ‰ | 2 | 0.4% |
| † | 1 | 0.2% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˜ | 48 | |
| ˆ | 1 | 2.0% |
runtime
Real number (ℝ)
| Distinct | 247 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.07086 |
| Minimum | 0 |
|---|---|
| Maximum | 900 |
| Zeros | 31 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 75 |
| Q1 | 90 |
| median | 99 |
| Q3 | 111 |
| 95-th percentile | 139 |
| Maximum | 900 |
| Range | 900 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 31.381405 |
|---|---|
| Coefficient of variation (CV) | 0.30744724 |
| Kurtosis | 116.23757 |
| Mean | 102.07086 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 6.1037928 |
| Sum | 1109102 |
| Variance | 984.79258 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 547 | 5.0% |
| 95 | 358 | 3.3% |
| 100 | 335 | 3.1% |
| 93 | 328 | 3.0% |
| 97 | 306 | 2.8% |
| 96 | 300 | 2.8% |
| 91 | 297 | 2.7% |
| 94 | 292 | 2.7% |
| 98 | 270 | 2.5% |
| 92 | 270 | 2.5% |
| Other values (237) | 7563 |
| Value | Count | Frequency (%) |
| 0 | 31 | |
| 2 | 5 | < 0.1% |
| 3 | 11 | 0.1% |
| 4 | 17 | |
| 5 | 17 | |
| 6 | 22 | |
| 7 | 17 | |
| 8 | 9 | 0.1% |
| 9 | 7 | 0.1% |
| 10 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 900 | 1 | |
| 877 | 1 | |
| 705 | 1 | |
| 566 | 1 | |
| 561 | 1 | |
| 550 | 1 | |
| 540 | 1 | |
| 501 | 1 | |
| 500 | 1 | |
| 470 | 1 |
genres
Text
| Distinct | 2039 |
|---|---|
| Distinct (%) | 18.8% |
| Missing | 23 |
| Missing (%) | 0.2% |
| Memory size | 85.0 KiB |
Length
| Max length | 51 |
|---|---|
| Median length | 44 |
| Mean length | 18.537121 |
| Min length | 3 |
Characters and Unicode
| Total characters | 200998 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1225 ? |
|---|---|
| Unique (%) | 11.3% |
Sample
| 1st row | Action|Adventure|Science Fiction|Thriller |
|---|---|
| 2nd row | Action|Adventure|Science Fiction|Thriller |
| 3rd row | Adventure|Science Fiction|Thriller |
| 4th row | Action|Adventure|Science Fiction|Fantasy |
| 5th row | Action|Crime|Thriller |
| Value | Count | Frequency (%) |
| comedy | 712 | 5.8% |
| drama | 712 | 5.8% |
| fiction | 670 | 5.5% |
| documentary | 312 | 2.5% |
| drama|romance | 289 | 2.4% |
| comedy|drama | 280 | 2.3% |
| comedy|romance | 268 | 2.2% |
| horror|thriller | 259 | 2.1% |
| horror | 253 | 2.1% |
| comedy|drama|romance | 222 | 1.8% |
| Other values (1900) | 8263 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 20601 | 10.2% |
| e | 17185 | 8.5% |
| | | 16117 | 8.0% |
| a | 15786 | 7.9% |
| o | 14302 | 7.1% |
| m | 14071 | 7.0% |
| i | 14064 | 7.0% |
| n | 11215 | 5.6% |
| c | 8715 | 4.3% |
| t | 8530 | 4.2% |
| Other values (20) | 60412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 154960 | |
| Uppercase Letter | 28524 | 14.2% |
| Math Symbol | 16117 | 8.0% |
| Space Separator | 1397 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 20601 | |
| e | 17185 | |
| a | 15786 | |
| o | 14302 | |
| m | 14071 | |
| i | 14064 | |
| n | 11215 | |
| c | 8715 | 5.6% |
| t | 8530 | 5.5% |
| y | 8414 | 5.4% |
| Other values (7) | 22077 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 5281 | |
| C | 5148 | |
| A | 4555 | |
| F | 3565 | |
| T | 3075 | |
| H | 1971 | 6.9% |
| R | 1712 | 6.0% |
| M | 1385 | 4.9% |
| S | 1230 | 4.3% |
| W | 435 | 1.5% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 16117 |
Space Separator
| Value | Count | Frequency (%) |
| 1397 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 183484 | |
| Common | 17514 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 20601 | |
| e | 17185 | 9.4% |
| a | 15786 | 8.6% |
| o | 14302 | 7.8% |
| m | 14071 | 7.7% |
| i | 14064 | 7.7% |
| n | 11215 | 6.1% |
| c | 8715 | 4.7% |
| t | 8530 | 4.6% |
| y | 8414 | 4.6% |
| Other values (18) | 50601 |
Common
| Value | Count | Frequency (%) |
| | | 16117 | |
| 1397 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 200998 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 20601 | 10.2% |
| e | 17185 | 8.5% |
| | | 16117 | 8.0% |
| a | 15786 | 7.9% |
| o | 14302 | 7.1% |
| m | 14071 | 7.0% |
| i | 14064 | 7.0% |
| n | 11215 | 5.6% |
| c | 8715 | 4.3% |
| t | 8530 | 4.2% |
| Other values (20) | 60412 |
MISSING 
| Distinct | 7445 |
|---|---|
| Distinct (%) | 75.7% |
| Missing | 1030 |
| Missing (%) | 9.5% |
| Memory size | 85.0 KiB |
Length
| Max length | 184 |
|---|---|
| Median length | 128 |
| Mean length | 45.516165 |
| Min length | 3 |
Characters and Unicode
| Total characters | 447697 |
|---|---|
| Distinct characters | 120 |
| Distinct categories | 17 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 6850 ? |
|---|---|
| Unique (%) | 69.6% |
Sample
| 1st row | Universal Studios|Amblin Entertainment|Legendary Pictures|Fuji Television Network|Dentsu |
|---|---|
| 2nd row | Village Roadshow Pictures|Kennedy Miller Productions |
| 3rd row | Summit Entertainment|Mandeville Films|Red Wagon Entertainment|NeoReel |
| 4th row | Lucasfilm|Truenorth Productions|Bad Robot |
| 5th row | Universal Pictures|Original Film|Media Rights Capital|Dentsu|One Race Films |
| Value | Count | Frequency (%) |
| pictures | 1880 | 4.3% |
| productions | 1750 | 4.0% |
| films | 1571 | 3.6% |
| entertainment | 1187 | 2.7% |
| film | 1183 | 2.7% |
| universal | 512 | 1.2% |
| fox | 480 | 1.1% |
| paramount | 442 | 1.0% |
| century | 421 | 1.0% |
| columbia | 412 | 0.9% |
| Other values (12817) | 34206 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 35180 | 7.9% |
| 34207 | 7.6% | |
| e | 33390 | 7.5% |
| n | 31750 | 7.1% |
| t | 30796 | 6.9% |
| r | 29487 | 6.6% |
| o | 26480 | 5.9% |
| a | 24331 | 5.4% |
| s | 21969 | 4.9% |
| l | 15887 | 3.5% |
| Other values (110) | 164220 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 329907 | |
| Uppercase Letter | 63468 | 14.2% |
| Space Separator | 34213 | 7.6% |
| Math Symbol | 13544 | 3.0% |
| Other Punctuation | 2192 | 0.5% |
| Decimal Number | 1645 | 0.4% |
| Dash Punctuation | 849 | 0.2% |
| Open Punctuation | 721 | 0.2% |
| Close Punctuation | 720 | 0.2% |
| Other Symbol | 320 | 0.1% |
| Other values (7) | 118 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 35180 | |
| e | 33390 | |
| n | 31750 | |
| t | 30796 | |
| r | 29487 | |
| o | 26480 | |
| a | 24331 | 7.4% |
| s | 21969 | 6.7% |
| l | 15887 | 4.8% |
| u | 15355 | 4.7% |
| Other values (21) | 65282 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 10513 | |
| F | 7597 | |
| C | 5813 | 9.2% |
| M | 3907 | 6.2% |
| E | 3880 | 6.1% |
| S | 3731 | 5.9% |
| B | 2962 | 4.7% |
| T | 2804 | 4.4% |
| A | 2700 | 4.3% |
| G | 2331 | 3.7% |
| Other values (20) | 17230 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1411 | |
| / | 273 | 12.5% |
| & | 214 | 9.8% |
| , | 119 | 5.4% |
| ' | 88 | 4.0% |
| ‰ | 20 | 0.9% |
| ¡ | 18 | 0.8% |
| ¶ | 11 | 0.5% |
| ! | 11 | 0.5% |
| § | 7 | 0.3% |
| Other values (10) | 20 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 418 | |
| 0 | 401 | |
| 1 | 204 | |
| 4 | 170 | |
| 3 | 145 | 8.8% |
| 9 | 83 | 5.0% |
| 6 | 69 | 4.2% |
| 7 | 63 | 3.8% |
| 8 | 49 | 3.0% |
| 5 | 43 | 2.6% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¨ | 8 | |
| ´ | 3 | 18.8% |
| ¯ | 3 | 18.8% |
| ¸ | 1 | 6.2% |
| ˜ | 1 | 6.2% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 13391 | |
| + | 132 | 1.0% |
| ± | 21 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 719 | |
| „ | 1 | 0.1% |
| [ | 1 | 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 20 | |
| ¢ | 2 | 8.7% |
| ¥ | 1 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 34207 | ||
| 6 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 848 | |
| – | 1 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 719 | |
| ] | 1 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 314 | |
| ° | 6 | 1.9% |
Other Number
| Value | Count | Frequency (%) |
| ³ | 48 | |
| ¼ | 5 | 9.4% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 3 | |
| º | 2 |
Format
| Value | Count | Frequency (%) |
| | 18 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 393377 | |
| Common | 54320 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 35180 | 8.9% |
| e | 33390 | 8.5% |
| n | 31750 | 8.1% |
| t | 30796 | 7.8% |
| r | 29487 | 7.5% |
| o | 26480 | 6.7% |
| a | 24331 | 6.2% |
| s | 21969 | 5.6% |
| l | 15887 | 4.0% |
| u | 15355 | 3.9% |
| Other values (52) | 128752 |
Common
| Value | Count | Frequency (%) |
| 34207 | ||
| | | 13391 | 24.7% |
| . | 1411 | 2.6% |
| - | 848 | 1.6% |
| ) | 719 | 1.3% |
| ( | 719 | 1.3% |
| 2 | 418 | 0.8% |
| 0 | 401 | 0.7% |
| © | 314 | 0.6% |
| / | 273 | 0.5% |
| Other values (48) | 1619 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 446644 | |
| None | 1027 | 0.2% |
| Punctuation | 25 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 35180 | 7.9% |
| 34207 | 7.7% | |
| e | 33390 | 7.5% |
| n | 31750 | 7.1% |
| t | 30796 | 6.9% |
| r | 29487 | 6.6% |
| o | 26480 | 5.9% |
| a | 24331 | 5.4% |
| s | 21969 | 4.9% |
| l | 15887 | 3.6% |
| Other values (75) | 163167 |
None
| Value | Count | Frequency (%) |
| Ã | 514 | |
| © | 314 | |
| ³ | 48 | 4.7% |
| ± | 21 | 2.0% |
| ¤ | 20 | 1.9% |
| ¡ | 18 | 1.8% |
| | 18 | 1.8% |
| ¶ | 11 | 1.1% |
| ¨ | 8 | 0.8% |
| § | 7 | 0.7% |
| Other values (18) | 48 | 4.7% |
Punctuation
| Value | Count | Frequency (%) |
| ‰ | 20 | |
| – | 1 | 4.0% |
| • | 1 | 4.0% |
| ” | 1 | 4.0% |
| „ | 1 | 4.0% |
| … | 1 | 4.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˜ | 1 |
release_date
Date
| Distinct | 5909 |
|---|---|
| Distinct (%) | 54.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 85.0 KiB |
| Minimum | 1973-01-01 00:00:00 |
|---|---|
| Maximum | 2072-12-19 00:00:00 |
vote_count
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1289 |
|---|---|
| Distinct (%) | 11.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 217.38975 |
| Minimum | 10 |
|---|---|
| Maximum | 9767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 17 |
| median | 38 |
| Q3 | 145.75 |
| 95-th percentile | 1025.75 |
| Maximum | 9767 |
| Range | 9757 |
| Interquartile range (IQR) | 128.75 |
Descriptive statistics
| Standard deviation | 575.61906 |
|---|---|
| Coefficient of variation (CV) | 2.6478666 |
| Kurtosis | 53.360979 |
| Mean | 217.38975 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 6.1773058 |
| Sum | 2362157 |
| Variance | 331337.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 501 | 4.6% |
| 11 | 474 | 4.4% |
| 12 | 422 | 3.9% |
| 13 | 377 | 3.5% |
| 14 | 323 | 3.0% |
| 15 | 300 | 2.8% |
| 16 | 270 | 2.5% |
| 17 | 256 | 2.4% |
| 18 | 218 | 2.0% |
| 19 | 189 | 1.7% |
| Other values (1279) | 7536 |
| Value | Count | Frequency (%) |
| 10 | 501 | |
| 11 | 474 | |
| 12 | 422 | |
| 13 | 377 | |
| 14 | 323 | |
| 15 | 300 | |
| 16 | 270 | |
| 17 | 256 | |
| 18 | 218 | |
| 19 | 189 | 1.7% |
| Value | Count | Frequency (%) |
| 9767 | 1 | |
| 8903 | 1 | |
| 8458 | 1 | |
| 8432 | 1 | |
| 7375 | 1 | |
| 7080 | 1 | |
| 6882 | 1 | |
| 6723 | 1 | |
| 6498 | 1 | |
| 6417 | 1 |
vote_average
Real number (ℝ)
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.9749218 |
| Minimum | 1.5 |
|---|---|
| Maximum | 9.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 1.5 |
|---|---|
| 5-th percentile | 4.4 |
| Q1 | 5.4 |
| median | 6 |
| Q3 | 6.6 |
| 95-th percentile | 7.4 |
| Maximum | 9.2 |
| Range | 7.7 |
| Interquartile range (IQR) | 1.2 |
Descriptive statistics
| Standard deviation | 0.93514182 |
|---|---|
| Coefficient of variation (CV) | 0.15651114 |
| Kurtosis | 0.54350325 |
| Mean | 5.9749218 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | -0.43590798 |
| Sum | 64923.5 |
| Variance | 0.87449021 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.1 | 496 | 4.6% |
| 6 | 495 | 4.6% |
| 5.8 | 486 | 4.5% |
| 5.9 | 473 | 4.4% |
| 6.2 | 464 | 4.3% |
| 6.3 | 461 | 4.2% |
| 6.5 | 457 | 4.2% |
| 6.4 | 446 | 4.1% |
| 5.7 | 415 | 3.8% |
| 6.6 | 413 | 3.8% |
| Other values (62) | 6260 |
| Value | Count | Frequency (%) |
| 1.5 | 2 | < 0.1% |
| 2 | 1 | < 0.1% |
| 2.1 | 3 | |
| 2.2 | 3 | |
| 2.3 | 2 | < 0.1% |
| 2.4 | 7 | |
| 2.5 | 2 | < 0.1% |
| 2.6 | 3 | |
| 2.7 | 3 | |
| 2.8 | 7 |
| Value | Count | Frequency (%) |
| 9.2 | 1 | < 0.1% |
| 8.9 | 1 | < 0.1% |
| 8.8 | 2 | < 0.1% |
| 8.7 | 1 | < 0.1% |
| 8.6 | 1 | < 0.1% |
| 8.5 | 6 | 0.1% |
| 8.4 | 10 | |
| 8.3 | 10 | |
| 8.2 | 6 | 0.1% |
| 8.1 | 16 |
release_year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2001.3227 |
| Minimum | 1960 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 1960 |
|---|---|
| 5-th percentile | 1973 |
| Q1 | 1995 |
| median | 2006 |
| Q3 | 2011 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 55 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 12.812941 |
|---|---|
| Coefficient of variation (CV) | 0.0064022363 |
| Kurtosis | 0.80005132 |
| Mean | 2001.3227 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -1.2042543 |
| Sum | 21746372 |
| Variance | 164.17145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 700 | 6.4% |
| 2013 | 659 | 6.1% |
| 2015 | 629 | 5.8% |
| 2012 | 588 | 5.4% |
| 2011 | 540 | 5.0% |
| 2009 | 533 | 4.9% |
| 2008 | 496 | 4.6% |
| 2010 | 490 | 4.5% |
| 2007 | 438 | 4.0% |
| 2006 | 408 | 3.8% |
| Other values (46) | 5385 |
| Value | Count | Frequency (%) |
| 1960 | 32 | |
| 1961 | 31 | |
| 1962 | 32 | |
| 1963 | 34 | |
| 1964 | 42 | |
| 1965 | 35 | |
| 1966 | 46 | |
| 1967 | 40 | |
| 1968 | 39 | |
| 1969 | 31 |
| Value | Count | Frequency (%) |
| 2015 | 629 | |
| 2014 | 700 | |
| 2013 | 659 | |
| 2012 | 588 | |
| 2011 | 540 | |
| 2010 | 490 | |
| 2009 | 533 | |
| 2008 | 496 | |
| 2007 | 438 | |
| 2006 | 408 |
budget_adj
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2614 |
|---|---|
| Distinct (%) | 24.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17551040 |
| Minimum | 0 |
|---|---|
| Maximum | 4.25 × 108 |
| Zeros | 5696 |
| Zeros (%) | 52.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 20853251 |
| 95-th percentile | 89375138 |
| Maximum | 4.25 × 108 |
| Range | 4.25 × 108 |
| Interquartile range (IQR) | 20853251 |
Descriptive statistics
| Standard deviation | 34306156 |
|---|---|
| Coefficient of variation (CV) | 1.9546509 |
| Kurtosis | 13.036952 |
| Mean | 17551040 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.1149199 |
| Sum | 1.907096 × 1011 |
| Variance | 1.1769123 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5696 | |
| 10164004.34 | 17 | 0.2% |
| 21033371.65 | 17 | 0.2% |
| 20000000 | 16 | 0.1% |
| 4605455.254 | 15 | 0.1% |
| 24234951.06 | 14 | 0.1% |
| 33496898.69 | 14 | 0.1% |
| 40656017.36 | 13 | 0.1% |
| 20328008.68 | 13 | 0.1% |
| 26291714.57 | 13 | 0.1% |
| Other values (2604) | 5038 |
| Value | Count | Frequency (%) |
| 0 | 5696 | |
| 0.9210910508 | 1 | < 0.1% |
| 0.9693980426 | 1 | < 0.1% |
| 1.012786634 | 1 | < 0.1% |
| 1.309052847 | 1 | < 0.1% |
| 2.908194128 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4.519284805 | 1 | < 0.1% |
| 4.605455254 | 1 | < 0.1% |
| 5.006695621 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 425000000 | 1 | |
| 368371256.2 | 1 | |
| 315500574.8 | 1 | |
| 292050672.7 | 1 | |
| 271692064.2 | 1 | |
| 271330494.3 | 1 | |
| 260000000 | 1 | |
| 257599886.7 | 1 | |
| 254100108.5 | 1 | |
| 250419201.7 | 1 |
revenue_adj
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 4840 |
|---|---|
| Distinct (%) | 44.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51364363 |
| Minimum | 0 |
|---|---|
| Maximum | 2.8271238 × 109 |
| Zeros | 6016 |
| Zeros (%) | 55.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 85.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33697096 |
| 95-th percentile | 2.7655444 × 108 |
| Maximum | 2.8271238 × 109 |
| Range | 2.8271238 × 109 |
| Interquartile range (IQR) | 33697096 |
Descriptive statistics
| Standard deviation | 1.4463249 × 108 |
|---|---|
| Coefficient of variation (CV) | 2.8158138 |
| Kurtosis | 63.379908 |
| Mean | 51364363 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.2512021 |
| Sum | 5.5812517 × 1011 |
| Variance | 2.0918556 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6016 | |
| 117753430.8 | 2 | < 0.1% |
| 14389144.83 | 2 | < 0.1% |
| 1000000 | 2 | < 0.1% |
| 29106404.28 | 2 | < 0.1% |
| 31721459 | 2 | < 0.1% |
| 967000 | 2 | < 0.1% |
| 81036423.44 | 2 | < 0.1% |
| 57667591.03 | 2 | < 0.1% |
| 26331569.65 | 2 | < 0.1% |
| Other values (4830) | 4832 |
| Value | Count | Frequency (%) |
| 0 | 6016 | |
| 2.37070529 | 1 | < 0.1% |
| 2.861933734 | 1 | < 0.1% |
| 3.038359901 | 1 | < 0.1% |
| 5.926763224 | 1 | < 0.1% |
| 6.951083695 | 1 | < 0.1% |
| 8.585801203 | 1 | < 0.1% |
| 9.05681977 | 1 | < 0.1% |
| 9.115079704 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2827123750 | 1 | |
| 2789712242 | 1 | |
| 2506405735 | 1 | |
| 2167324901 | 1 | |
| 1907005842 | 1 | |
| 1902723130 | 1 | |
| 1791694309 | 1 | |
| 1583049536 | 1 | |
| 1574814740 | 1 | |
| 1443191435 | 1 |
| id | popularity | budget | revenue | runtime | vote_count | vote_average | release_year | budget_adj | revenue_adj | |
|---|---|---|---|---|---|---|---|---|---|---|
| id | 1.000 | -0.288 | -0.330 | -0.359 | -0.247 | -0.275 | -0.131 | 0.691 | -0.360 | -0.376 |
| popularity | -0.288 | 1.000 | 0.568 | 0.619 | 0.245 | 0.780 | 0.152 | 0.028 | 0.562 | 0.610 |
| budget | -0.330 | 0.568 | 1.000 | 0.708 | 0.313 | 0.611 | 0.066 | -0.033 | 0.994 | 0.695 |
| revenue | -0.359 | 0.619 | 0.708 | 1.000 | 0.330 | 0.682 | 0.186 | -0.088 | 0.710 | 0.997 |
| runtime | -0.247 | 0.245 | 0.313 | 0.330 | 1.000 | 0.247 | 0.249 | -0.180 | 0.324 | 0.334 |
| vote_count | -0.275 | 0.780 | 0.611 | 0.682 | 0.247 | 1.000 | 0.271 | 0.094 | 0.604 | 0.671 |
| vote_average | -0.131 | 0.152 | 0.066 | 0.186 | 0.249 | 0.271 | 1.000 | -0.098 | 0.079 | 0.193 |
| release_year | 0.691 | 0.028 | -0.033 | -0.088 | -0.180 | 0.094 | -0.098 | 1.000 | -0.084 | -0.123 |
| budget_adj | -0.360 | 0.562 | 0.994 | 0.710 | 0.324 | 0.604 | 0.079 | -0.084 | 1.000 | 0.704 |
| revenue_adj | -0.376 | 0.610 | 0.695 | 0.997 | 0.334 | 0.671 | 0.193 | -0.123 | 0.704 | 1.000 |
| id | imdb_id | popularity | budget | revenue | original_title | cast | homepage | director | tagline | keywords | overview | runtime | genres | production_companies | release_date | vote_count | vote_average | release_year | budget_adj | revenue_adj | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 135397 | tt0369610 | 32.985763 | 150000000 | 1513528810 | Jurassic World | Chris Pratt|Bryce Dallas Howard|Irrfan Khan|Vincent D'Onofrio|Nick Robinson | http://www.jurassicworld.com/ | Colin Trevorrow | The park is open. | monster|dna|tyrannosaurus rex|velociraptor|island | Twenty-two years after the events of Jurassic Park, Isla Nublar now features a fully functioning dinosaur theme park, Jurassic World, as originally envisioned by John Hammond. | 124 | Action|Adventure|Science Fiction|Thriller | Universal Studios|Amblin Entertainment|Legendary Pictures|Fuji Television Network|Dentsu | 6/9/15 | 5562 | 6.5 | 2015 | 1.379999e+08 | 1.392446e+09 |
| 1 | 76341 | tt1392190 | 28.419936 | 150000000 | 378436354 | Mad Max: Fury Road | Tom Hardy|Charlize Theron|Hugh Keays-Byrne|Nicholas Hoult|Josh Helman | http://www.madmaxmovie.com/ | George Miller | What a Lovely Day. | future|chase|post-apocalyptic|dystopia|australia | An apocalyptic story set in the furthest reaches of our planet, in a stark desert landscape where humanity is broken, and most everyone is crazed fighting for the necessities of life. Within this world exist two rebels on the run who just might be able to restore order. There's Max, a man of action and a man of few words, who seeks peace of mind following the loss of his wife and child in the aftermath of the chaos. And Furiosa, a woman of action and a woman who believes her path to survival may be achieved if she can make it across the desert back to her childhood homeland. | 120 | Action|Adventure|Science Fiction|Thriller | Village Roadshow Pictures|Kennedy Miller Productions | 5/13/15 | 6185 | 7.1 | 2015 | 1.379999e+08 | 3.481613e+08 |
| 2 | 262500 | tt2908446 | 13.112507 | 110000000 | 295238201 | Insurgent | Shailene Woodley|Theo James|Kate Winslet|Ansel Elgort|Miles Teller | http://www.thedivergentseries.movie/#insurgent | Robert Schwentke | One Choice Can Destroy You | based on novel|revolution|dystopia|sequel|dystopic future | Beatrice Prior must confront her inner demons and continue her fight against a powerful alliance which threatens to tear her society apart. | 119 | Adventure|Science Fiction|Thriller | Summit Entertainment|Mandeville Films|Red Wagon Entertainment|NeoReel | 3/18/15 | 2480 | 6.3 | 2015 | 1.012000e+08 | 2.716190e+08 |
| 3 | 140607 | tt2488496 | 11.173104 | 200000000 | 2068178225 | Star Wars: The Force Awakens | Harrison Ford|Mark Hamill|Carrie Fisher|Adam Driver|Daisy Ridley | http://www.starwars.com/films/star-wars-episode-vii | J.J. Abrams | Every generation has a story. | android|spaceship|jedi|space opera|3d | Thirty years after defeating the Galactic Empire, Han Solo and his allies face a new threat from the evil Kylo Ren and his army of Stormtroopers. | 136 | Action|Adventure|Science Fiction|Fantasy | Lucasfilm|Truenorth Productions|Bad Robot | 12/15/15 | 5292 | 7.5 | 2015 | 1.839999e+08 | 1.902723e+09 |
| 4 | 168259 | tt2820852 | 9.335014 | 190000000 | 1506249360 | Furious 7 | Vin Diesel|Paul Walker|Jason Statham|Michelle Rodriguez|Dwayne Johnson | http://www.furious7.com/ | James Wan | Vengeance Hits Home | car race|speed|revenge|suspense|car | Deckard Shaw seeks revenge against Dominic Toretto and his family for his comatose brother. | 137 | Action|Crime|Thriller | Universal Pictures|Original Film|Media Rights Capital|Dentsu|One Race Films | 4/1/15 | 2947 | 7.3 | 2015 | 1.747999e+08 | 1.385749e+09 |
| 5 | 281957 | tt1663202 | 9.110700 | 135000000 | 532950503 | The Revenant | Leonardo DiCaprio|Tom Hardy|Will Poulter|Domhnall Gleeson|Paul Anderson | http://www.foxmovies.com/movies/the-revenant | Alejandro González Iñárritu | (n. One who has returned, as if from the dead.) | father-son relationship|rape|based on novel|mountains|winter | In the 1820s, a frontiersman, Hugh Glass, sets out on a path of vengeance against those who left him for dead after a bear mauling. | 156 | Western|Drama|Adventure|Thriller | Regency Enterprises|Appian Way|CatchPlay|Anonymous Content|New Regency Pictures | 12/25/15 | 3929 | 7.2 | 2015 | 1.241999e+08 | 4.903142e+08 |
| 6 | 87101 | tt1340138 | 8.654359 | 155000000 | 440603537 | Terminator Genisys | Arnold Schwarzenegger|Jason Clarke|Emilia Clarke|Jai Courtney|J.K. Simmons | http://www.terminatormovie.com/ | Alan Taylor | Reset the future | saving the world|artificial intelligence|cyborg|killer robot|future | The year is 2029. John Connor, leader of the resistance continues the war against the machines. At the Los Angeles offensive, John's fears of the unknown future begin to emerge when TECOM spies reveal a new plot by SkyNet that will attack him from both fronts; past and future, and will ultimately change warfare forever. | 125 | Science Fiction|Action|Thriller|Adventure | Paramount Pictures|Skydance Productions | 6/23/15 | 2598 | 5.8 | 2015 | 1.425999e+08 | 4.053551e+08 |
| 7 | 286217 | tt3659388 | 7.667400 | 108000000 | 595380321 | The Martian | Matt Damon|Jessica Chastain|Kristen Wiig|Jeff Daniels|Michael Peña | http://www.foxmovies.com/movies/the-martian | Ridley Scott | Bring Him Home | based on novel|mars|nasa|isolation|botanist | During a manned mission to Mars, Astronaut Mark Watney is presumed dead after a fierce storm and left behind by his crew. But Watney has survived and finds himself stranded and alone on the hostile planet. With only meager supplies, he must draw upon his ingenuity, wit and spirit to subsist and find a way to signal to Earth that he is alive. | 141 | Drama|Adventure|Science Fiction | Twentieth Century Fox Film Corporation|Scott Free Productions|Mid Atlantic Films|International Traders|TSG Entertainment | 9/30/15 | 4572 | 7.6 | 2015 | 9.935996e+07 | 5.477497e+08 |
| 8 | 211672 | tt2293640 | 7.404165 | 74000000 | 1156730962 | Minions | Sandra Bullock|Jon Hamm|Michael Keaton|Allison Janney|Steve Coogan | http://www.minionsmovie.com/ | Kyle Balda|Pierre Coffin | Before Gru, they had a history of bad bosses | assistant|aftercreditsstinger|duringcreditsstinger|evil mastermind|minions | Minions Stuart, Kevin and Bob are recruited by Scarlet Overkill, a super-villain who, alongside her inventor husband Herb, hatches a plot to take over the world. | 91 | Family|Animation|Adventure|Comedy | Universal Pictures|Illumination Entertainment | 6/17/15 | 2893 | 6.5 | 2015 | 6.807997e+07 | 1.064192e+09 |
| 9 | 150540 | tt2096673 | 6.326804 | 175000000 | 853708609 | Inside Out | Amy Poehler|Phyllis Smith|Richard Kind|Bill Hader|Lewis Black | http://movies.disney.com/inside-out | Pete Docter | Meet the little voices inside your head. | dream|cartoon|imaginary friend|animation|kid | Growing up can be a bumpy road, and it's no exception for Riley, who is uprooted from her Midwest life when her father starts a new job in San Francisco. Like all of us, Riley is guided by her emotions - Joy, Fear, Anger, Disgust and Sadness. The emotions live in Headquarters, the control center inside Riley's mind, where they help advise her through everyday life. As Riley and her emotions struggle to adjust to a new life in San Francisco, turmoil ensues in Headquarters. Although Joy, Riley's main and most important emotion, tries to keep things positive, the emotions conflict on how best to navigate a new city, house and school. | 94 | Comedy|Animation|Family | Walt Disney Pictures|Pixar Animation Studios|Walt Disney Studios Motion Pictures | 6/9/15 | 3935 | 8.0 | 2015 | 1.609999e+08 | 7.854116e+08 |
| id | imdb_id | popularity | budget | revenue | original_title | cast | homepage | director | tagline | keywords | overview | runtime | genres | production_companies | release_date | vote_count | vote_average | release_year | budget_adj | revenue_adj | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10856 | 20277 | tt0061135 | 0.140934 | 0 | 0 | The Ugly Dachshund | Dean Jones|Suzanne Pleshette|Charles Ruggles|Kelly Thordsen|Parley Baer | NaN | Norman Tokar | A HAPPY HONEYMOON GOES TO THE DOGS!...When a Great Dane disguised as a Dachsie crashes the party! | great dane|dachshund | The Garrisons (Dean Jones and Suzanne Pleshette) are the "proud parents" of three adorable dachshund pups -- and one overgrown Great Dane named Brutus, who nevertheless thinks of himself as a dainty dachsie. His identity crisis results in an uproarious series of household crises that reduce the Garrisons' house to shambles -- and viewers to howls of laughter! | 93 | Comedy|Drama|Family | Walt Disney Pictures | 2/16/66 | 14 | 5.7 | 1966 | 0.000000 | 0.0 |
| 10857 | 5921 | tt0060748 | 0.131378 | 0 | 0 | Nevada Smith | Steve McQueen|Karl Malden|Brian Keith|Arthur Kennedy|Suzanne Pleshette | NaN | Henry Hathaway | Some called him savage- and some called him saint... some felt his hate- and one found his love... and three had to die... | repayment|revenge|native american|wild west|half breed | Nevada Smith is the young son of an Indian mother and white father. When his father is killed by three men over gold, Nevada sets out to find them and kill them. The boy is taken in by a gun merchant. The gun merchant shows him how to shoot and to shoot on time and correct. | 128 | Action|Western | Paramount Pictures|Solar Productions|Embassy Pictures | 6/10/66 | 10 | 5.9 | 1966 | 0.000000 | 0.0 |
| 10858 | 31918 | tt0060921 | 0.317824 | 0 | 0 | The Russians Are Coming, The Russians Are Coming | Carl Reiner|Eva Marie Saint|Alan Arkin|Brian Keith|Paul Ford | NaN | Norman Jewison | IT'S A PLOT! ...to make the world die laughing!! | cold war|russian|new england | Without hostile intent, a Soviet sub runs aground off New England. Men are sent for a boat, but many villagers go into a tizzy, risking bloodshed. | 126 | Comedy|War | The Mirisch Corporation | 5/25/66 | 11 | 5.5 | 1966 | 0.000000 | 0.0 |
| 10859 | 20620 | tt0060955 | 0.089072 | 0 | 0 | Seconds | Rock Hudson|Salome Jens|John Randolph|Will Geer|Jeff Corey | NaN | John Frankenheimer | NaN | plastic surgery|suspense | A secret organisation offers wealthy people a second chance at life. The customer picks out someone they want to be and the organisation surgically alters the customer to look like the intended person, stages the customer's death, gets rid of the intended person and the customer takes on a new life. | 100 | Mystery|Science Fiction|Thriller|Drama | Gibraltar Productions|Joel Productions|John Frankenheimer Productions Inc. | 10/5/66 | 22 | 6.6 | 1966 | 0.000000 | 0.0 |
| 10860 | 5060 | tt0060214 | 0.087034 | 0 | 0 | Carry On Screaming! | Kenneth Williams|Jim Dale|Harry H. Corbett|Joan Sims|Charles Hawtrey | NaN | Gerald Thomas | Carry On Screaming with the Hilarious CARRY ON Gang!! | monster|carry on|horror spoof | The sinister Dr Watt has an evil scheme going. He's kidnapping beautiful young women and turning them into mannequins to sell to local stores. Fortunately for Dr Watt, Detective-Sergeant Bung is on the case, and he doesn't have a clue! In this send up of the Hammer Horror movies, there are send-ups of all the horror greats from Frankenstein to Dr Jekyl and Mr Hyde. | 87 | Comedy | Peter Rogers Productions|Anglo-Amalgamated Film Distributors | 5/20/66 | 13 | 7.0 | 1966 | 0.000000 | 0.0 |
| 10861 | 21 | tt0060371 | 0.080598 | 0 | 0 | The Endless Summer | Michael Hynson|Robert August|Lord 'Tally Ho' Blears|Bruce Brown|Chip Fitzwater | NaN | Bruce Brown | NaN | surfer|surfboard|surfing | The Endless Summer, by Bruce Brown, is one of the first and most influential surf movies of all times. The film documents American surfers Mike Hynson and Robert August as they travel the world during California’s winter (which back in 1965 was off-season for surfing) in search of the perfect wave and an endless summer. | 95 | Documentary | Bruce Brown Films | 6/15/66 | 11 | 7.4 | 1966 | 0.000000 | 0.0 |
| 10862 | 20379 | tt0060472 | 0.065543 | 0 | 0 | Grand Prix | James Garner|Eva Marie Saint|Yves Montand|ToshirÅ Mifune|Brian Bedford | NaN | John Frankenheimer | Cinerama sweeps YOU into a drama of speed and spectacle! | car race|racing|formula 1 | Grand Prix driver Pete Aron is fired by his team after a crash at Monaco that injures his teammate, Scott Stoddard. While Stoddard struggles to recover, Aron begins to drive for another team, and starts dating Stoddard's wife. | 176 | Action|Adventure|Drama | Cherokee Productions|Joel Productions|Douglas & Lewis Productions | 12/21/66 | 20 | 5.7 | 1966 | 0.000000 | 0.0 |
| 10863 | 39768 | tt0060161 | 0.065141 | 0 | 0 | Beregis Avtomobilya | Innokentiy Smoktunovskiy|Oleg Efremov|Georgi Zhzhyonov|Olga Aroseva|Lyubov Dobrzhanskaya | NaN | Eldar Ryazanov | NaN | car|trolley|stealing car | An insurance agent who moonlights as a carthief steals cars various crooks and never from the common people. He sells the stolen cars and gives the money to charity. His best friend, a cop, is assigned to bring in this modern robin hood. | 94 | Mystery|Comedy | Mosfilm | 1/1/66 | 11 | 6.5 | 1966 | 0.000000 | 0.0 |
| 10864 | 21449 | tt0061177 | 0.064317 | 0 | 0 | What's Up, Tiger Lily? | Tatsuya Mihashi|Akiko Wakabayashi|Mie Hama|John Sebastian|Tadao Nakamaru | NaN | Woody Allen | WOODY ALLEN STRIKES BACK! | spoof | In comic Woody Allen's film debut, he took the Japanese action film "International Secret Police: Key of Keys" and re-dubbed it, changing the plot to make it revolve around a secret egg salad recipe. | 80 | Action|Comedy | Benedict Pictures Corp. | 11/2/66 | 22 | 5.4 | 1966 | 0.000000 | 0.0 |
| 10865 | 22293 | tt0060666 | 0.035919 | 19000 | 0 | Manos: The Hands of Fate | Harold P. Warren|Tom Neyman|John Reynolds|Diane Mahree|Stephanie Nielson | NaN | Harold P. Warren | It's Shocking! It's Beyond Your Imagination! | fire|gun|drive|sacrifice|flashlight | A family gets lost on the road and stumbles upon a hidden, underground, devil-worshiping cult led by the fearsome Master and his servant Torgo. | 74 | Horror | Norm-Iris | 11/15/66 | 15 | 1.5 | 1966 | 127642.279154 | 0.0 |
Most frequently occurring
| id | imdb_id | popularity | budget | revenue | original_title | cast | homepage | director | tagline | keywords | overview | runtime | genres | production_companies | release_date | vote_count | vote_average | release_year | budget_adj | revenue_adj | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 42194 | tt0411951 | 0.59643 | 30000000 | 967000 | TEKKEN | Jon Foo|Kelly Overton|Cary-Hiroyuki Tagawa|Ian Anthony Dale|Luke Goss | NaN | Dwight H. Little | Survival is no game | martial arts|dystopia|based on video game|martial arts tournament | In the year of 2039, after World Wars destroy much of the civilization as we know it, territories are no longer run by governments, but by corporations; the mightiest of which is the Mishima Zaibatsu. In order to placate the seething masses of this dystopia, Mishima sponsors Tekken, a tournament in which fighters battle until only one is left standing. | 92 | Crime|Drama|Action|Thriller|Science Fiction | Namco|Light Song Films | 3/20/10 | 110 | 5.0 | 2010 | 30000000.0 | 967000.0 | 2 |